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Abstract. Properties of Term Rewriting Systems are called modular iff they are pre- 
served under (and reflected by) disjoint union, i.e. when combining two Term Rewriting 
Systems with disjoint signatures. Convergence is the property of Infinitary Term Rewrit- 
ing Systems that all reduction sequences converge to a limit. Strong Convergence requires 
in addition that redex positions in a reduction sequence move arbitrarily deep. 

In this paper it is shown that both Convergence and Strong Convergence are modular 
properties of non-collapsing Infinitary Term Rewriting Systems, provided (for convergence) 
that the term metrics are granular. This generalises known modularity results beyond 
metric doo- 



Modular properties of Term Rewriting are properties that are preserved under (and 
reflected by) the disjoint union of signatures. They are of particular interest to reason 
about rewrite systems by divide and conquer. 

Since Toyama showed that confluence is [22] and strong normalisation is not [23J mod- 
ular, the modularity of many further properties have been investigated. Examples of other 
modular properties are: uniqueness of normal forms [13], weak normalisation, simple termi- 
nation; examples of non-modular properties are: completeness, strong confluence. Modu- 
larity proofs for these properties are sometimes complex [22j, sometimes simple: the modu- 
larity of weak confluence follows directly from the critical pair lemma [2] . Many modularity 
results come with syntactic restrictions, e.g. completeness is modular in the absence of 
non- left-linear rules [21] and strong normalisation in the absence of collapsing rules |16j . 

One aspect of Term Rewriting is that it can be viewed as a computational model 
for Functional Programming. Lazy Functional Programming exhibits a phenomenon that 
ordinary Term Rewriting does not really capture: reductions that converge to an infinite 
result in infinitely many steps. Partly for that reason, the concept of Infinitary Term 
Rewriting was devised in [5]; technically, it arises from the finite version by equipping 
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the signature with a metric dm and using as its universe of terms Ter"^{T,), the metric 
completion of the metric space {Ter{IJ),dm), where Ter[S) is the set of finite terms. 

It is common to only consider the metric doo (which goes back to [T]) in infinitary term 
rewriting, ever since its beginning in [5j; d^o sets the distance doo{t, t') to where k is the 
length of the shortest position at which the two terms t and t' have different subterm-roots. 
This also coincides with a straightforward co-inductive definition of infinitary terms |13j . 
The metric completion of doo plays a special role as it is a final co-algebra and thus contains 
all infinite terms. 

However, this is not the only metric of interest: for instance, the functional program- 
ming language Haskell permits to equip constructors with strictness annotations 0. These 
annotations change the universe of terms by ensuring that certain infinite terms are impos- 
sible to construct. For example: 

data LTree = Null | Bin LTree Int ILTree 
The exclamation mark before the third argument of the constructor Bin labels that ar- 
gument as strict. For Haskell, this means that to construct a value for Bin tl n t2 the 
expression t2 needs to be evaluated first. As a consequence it is impossible to construct 
an LTree with an infinite right spine. The same effect can be achieved by modifying the 
distance function [9j; in this case, we could define 

dm(Bin(a,6,c),Bin(x,y,z)) = m.&yi{dm{c, z),dm{b,y), ^ • dm{a,x)) 

With this, the distance between two finite LTree values would be 1 if their right spines have 
different lengths. Therefore, all Cauchy-sequences would eventually have right spines of a 
fixed finite length with the corresponding implication for metric completion. 

Metrics of this form are exactly the granular metrics mentioned in title and abstract. 
They distinguish between strict and lazy argument positions, which correspond to multi- 
plicative factors of 1 and ^ (respectively) in the components of their associated ultra-metric 
maps. The associated ultra-metric maps tell us how to compute distances between terms 
with the same root symbol, and the components split this task into different argument po- 
sitions. The reasons to treat these metrics separately are (i) the practical interest in them 
and (ii) that their restricted form is particularly well-behaved. 

Infinitary Term Rewriting views transfinite reductions as reduction sequences that are 
also convergent sequences, w.r.t. its metric. In a reduction sequence each term rewrites to 
its successor. An infinitary Term Rewriting System (iTRS) is called convergent iff all its 
reduction sequences converge. 

Before [lOj there were few positive results about convergence in the literature, instead 
research has focussed on strong convergence which in addition requires that redex positions 
in a reduction sequence cannot stay short: if we replace each contracted redex with a random 
term we would still have a converging sequence. 

It is known that strong normalisation is not modular for finitary Term Rewriting Sys- 
tems. The well-known counter example from [23] has a strongly normalising TRS with 
rule F(0, — >• F(x,x,x), and another one with rules G{x,y) — )■ x, G{x,y) — )■ y; their 
combination fails to be strong normalising as the term F{0, 1,G(0, 1)) reduces to itself in 
three steps. As we shall see, for the discrete metric strong convergence and strong nor- 
malisation coincide and therefore modularity of strong convergence can fail for other term 
metrics than doo- The example also shows that "finite convergence", the convergence of 
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all reduction sequences starting from a finite term, fails to be modular in both weak and 
strong form [19]. 

Simonsen |_19] proved that strong convergence is a modular property for left-linear iTRSs 
(for metric doo)', the left-linearity condition is indeed unnecessary [lOj. Proving such results 
for ordinary convergence is more complicated. 

Example 1.1. Ri = {F{x, x, y) F{x, y, x)} R2 = {0 ^ 5(0)} 

Both systems Ri and R2 are converging. In the combination we have reductions like 
F(0,0,0) ^ F(0,0, 1) ^ F(0, 1,0) ^ -F(1,1,0) ^ F(1,0, 1) ^ F(l,l,l), with 1 = S{0). 
If we project position 2 (or 3) from this reduction sequence, thus extracting a sequence of 
elements from the other system then we get a sequence where the values go down as well as 
up, i.e. on the partial order induced by R2- 0,0, 1, 1,0, 1, 1, . . . For an overall convergence 
proof we need to demonstrate why that projected sequence is supposed to be converging, 
and that is not trivial. 

If a reduction sequence in the disjoint union of two iTRSs is converging but not strongly 
converging then its top layer "remains active" throughout the sequence and in particular it 
can rearrange its principal subterms, as happened with and 1 in Example ll.il Informally, 
principal subterms are those subterms where first a change of signature occurs. 

These rearrangements are strongly constrained though by the fact that the iTRS as- 
sociated with the top layer is itself converging. In particular, it can be shown that the 
subterms at a principal position in the limit form a so-called focussed sequence; these have 
peculiar properties: (i) any infinite subsequence of a focussed sequence contains an infinite 
weak reduction sequence; (ii) any non-converging focussed sequence contains a subsequence 
which is a non-converging weak reduction sequence. Here, "weak reduction sequences" are 
reduction sequences with gaps: an element of such a sequence does not necessarily reduce 
to its neighbour in a single reduction step, this can take any number of steps. For ab- 
stract reduction systems over Hausdorff spaces, convergence of all reduction sequences and 
convergence of all focussed sequences are equivalent properties (Proposition 13. Sp . 

This implies that any reduction sequence in the combined iTRS must converge "layer 
by layer", and therefore is converging w.r.t. the metric doo- Thus such a sequence can 
only fail to converge if the infinite term the sequence is converging to (under metric doo) 
does not exist in the metric completion derived from the combined system's own metric 
dm- For example, if one system has the rule F{x) — t- G{x) and the other a unary function 
symbol H then the infinite term t = F{H{t)) may exist under the combined metric whilst 
u = G{H{u)) may not, in which case the combined system would fail to converge. In 
granular term metrics such rules can be dismissed as they instantly break convergence; in 
the example, a = F{a) would and h = G{h) would not exist. 

2. Preliminaries 

To make the terminology of this paper self-contained, this section contains a reminder 
of some basic definitions and facts about topological and metric spaces [H H], as well as 
sequences over them. 

A topological space is a pair (5, O) where S" is a set and O a subset of p{S) such that it 
is closed under finite intersections and arbitrary unions, and 0, S € C The elements of O 
are called open sets, their complements w.r.t. S are called closed sets. A function f : A ^ B 
between topological spaces is called continuous iff f~^{X) is open whenever X is open. 
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An open base B is a subset of p{S); its associated topology O is the smallest set such 
that B O and that (S, O) is a topological space. 

A neighbourhood of x G S" is a set N <^ S for which there is an yl G C such that 
X €z A <^ N . A topological space is called Hausdorjf iff any two distinct elements of S have 
disjoint neighbourhoods. It is called discrete if O = p{S). 

A sequence in a topological space {S, O) is a continuous function f : a ^ S, where a 
is an ordinal. The topology on ordinals a is defined as follows: for every x < a the sets 
{y \ y < x} and {y \ x < y} are open and form an open base, i.e. all other open sets arise 
through finite intersections and arbitrary unions of these sets. The sequence / is called 
open if a is a limit ordinal, otherwise it is closed. 

An open sequence f : a ^ S diverges iff it cannot be extended to a sequence /' : 
a + 1 — )• S where V7 < a. /'{"f) = fi'j)- Otherwise it converges. For example, if S is the set 
{0, 1} and f : uj ^ S is given as /(n) = n mod 2 then / diverges for the discrete topology 
on 5 (O = but it converges on the other 3 topologies one can define on S. 

A subsequence of / is a strictly monotonic and continuous function g : f3 ^ a. g is 
called cofinal [7] iff V7.7 < a =^ ^C-C < f3/\g{C) ^ 7- We say that a subsequence g converges 
iff the sequence fog does. Clearly, an open sequence converges iff all its subsequences do. 
Note: (i) cofinal subsequences compose: if (7 is a cofinal subsequence of / and h a cofinal 
subsequence of 51 then gohis a cofinal subsequence of /; (ii) subsequences are strictly normal 
functions on ordinals [15], i.e. their application distributes over suprema. 

An open sequence f : a ^ S converges to c iff it stays eventually within any neigh- 
bourhood of c. Notice that this notion coincides with a neighbourhood-based definition of 
being "continuous at a" which implies that if / converges to c then it converges, and dually 
if it converges it must converge to some value. 

A metric space is a pair (M, d) where M is a set and d : M x M ^ TZ is a distance 
function with the properties d{x, y) = <^=^ x = y and d{x, y) + d{x, z) > d{y, z) 
which implies that d is commutative and returns only positive values [4]. Metric spaces are 
topological spaces: j4 C M is open iff Vx G A.3e > 0. Vy G M. d{x, y) < e ^ y £ A. Metric 
spaces are Hausdorff. A special case of metric spaces are ultra-metric spaces which have a 
stronger triangle inequality: max{d{x,y),d{x, z)) > d{y,z). 

The diameter of a subset A C M is the supremum of all distances between elements 
of A. The diameter of an open sequence / : a — >• M is the smallest e > such that 
V/3 < a. 371,72 > f3. d(/(7i), /(72)) > e. 

A Cauchy-sequence over (M, d) is a function / : w — Af such that Ve > 0. 3k. Vm, n > 
k. d{f{m), f{n)) < e. Thus Cauchy-sequences are precisely those with a diameter of 0. The 
metric space is called complete iff all its Cauchy-sequences converge; each metric space M 
has a completion M', which is unique up to isomorphism. In infinitary term rewriting, 
metric completion is used to generate infinite terms from finite terms. A function / : M — )• 
between metric spaces (M, ^m) and (A^, d^) is called uniformly continuous iff Ve > 0.36 > 
0. Vx G M. My G M. dM{x,y) < 6 ^ d]\f{f{x),f{y)) < e. A uniformly continuous function 
f : M ^ N has a unique continuous extension to M* — t- A^* . Continuous functions preserve 
converging sequences, uniformly continuous functions also preserve Cauchy-sequences. A 
special case are non-expansive functions, characterised by the property dN{f{x),f{y)) < 
dM{x,y). If even dN {f (x) , f (y)) = dM{x,y) then / is an isometric embedding. Isometric 
embeddings both preserve and reflect Cauchy-sequences, and — if the spaces are complete 
— converging sequences. 
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3. Reduction systems and convergence 

A topological abstract reduction system (short: TARS) is a structure (5, O,— t-) such 
that (5, O) is a topological space and {S, — an abstract reduction system, i.e. — )■ is a 
binary relation on S. In the world of infinitary terms the underlying topological spaces are 
typically ultra-metric spaces, see O Proposition 3] and [201 Section 4.3]. The reason for 
the added generality of this section is that (i) the notion of convergence is a topological 
concept, (ii) certain convergence results about abstract reduction systems can be proven 
at this more general level, and (iii) not all forms of infinitary rewriting derive from metric 
spaces, e.g. for infinitary A-calculus Bohm- Trees relate to the Scott-topology [3j. 

A reduction sequence in a TARS (5, O, — >•) is a sequence f : a ^ S such that V/3./3 + 1 < 
a =^ f{(3) — f{(3 + 1). Recall that being a sequence entails the continuity of /. 

For infinitary term rewriting we are specifically interested in TARS where the underlying 
topological space is a metric space. This gives rise to the notion of a metric abstract reduction 
systems (short: MARS) (M, d, — t-) where (M, d) is a metric space, and — t- a relation on M. 

A TARS is called convergent iff all its open reduction sequences converge. Otherwise 
we call it divergent. 

Proposition 3.1. If the open sequence f : a ^ S diverges then for every s G S* there is 
a neighbourhood Ns, an ordinal f3s and a cofinal subsequence Wg fig ^ a of f such that 
yj<(3s.f{wsh))^Ns. 

Proof. Otherwise, there would be an s (possibly many) for which these premises are not 
true. In that case / converges to s. □ 

For topological spaces that are not Hausdorff it is possible that sequences converge to 
more than one value. For example, consider co with the cofinite topology: a subset A C u; 
is open iff either A is empty or a; \ ^ is finite. In that topology the identity function on lo 
converges to all numbers, because the sequence 1, 2,3,4,... will eventually stay within any 
cofinite set of natural numbers. 

Weak reduction of a TARS is defined as follows: t u iff there exists a closed 
reduction sequence / : a + 1 — )• with f{0) = t and f{a) = u. Clearly, the relation is 
transitive. We also write -^ww for the weak reduction of the TARS (5, O, — »to). Moreover, 
we call reduction sequences of the TARS (5, O, -^w) weak reduction sequences of the original 
TARS (5", O,— )•). These are the earlier mentioned "reduction sequences with gaps". 

Proposition 3.2. Let (S, O,— t-) be a convergent TARS, where {S,0) is Hausdorff. If 
t ^ww ^ then t u. 

Proof. We can prove the result by induction on the indexing ordinal a of the witnessing 
sequence / : a + 1 — >■ S" for i -^ww u- The base case a = is trivial and if a is a successor 
ordinal the result follows by the induction hypothesis and transitivity of 

Otherwise, a is a limit ordinal. For any f3 < a we have t fW) s^nd thus by 

the induction hypothesis t /(/?)• This reduction has an associated indexing function 
9/3 '■ 1/3 ^ S. The increasing sequence of ordinals 7^3 (with /3 approaching a) must converge 
to some ordinal 7 \n\ page 290]. W.l.o.g. we can assume that this is a limit ordinal as 
otherwise almost all reductions in the image of / would be empty. Moreover the functions 
gp agree on their common domain. In their limit they thus extend to a function 5 : 7 — >• 5 
which is an open reduction sequence. As the original TARS is convergent g can be extended 
to a closed sequence 5' : 7 + 1 — S*. Since both /(a) and 5' (7) are limits of the sequence 
/(/3) with /3 approaching a we must have 5' (7) = /(a) and the result follows. □ 
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It is easiest to explain the meaning of Proposition 13.21 by showing how the property 
fails if we drop the condition that the TARS is convergent. Consider the following example, 
given as an infinitary string rewriting system: 

Example 3.3. {BE CSE, AC AB, BS SB, SC CS} 
In this system we have for any n: 

ABS'^E AS'^BE AS'^CSE ACS'^^^E ABS'^^^E 

Thus ABS^E ^* ABS'^+^E and also ABS''E -^^ ABS'^'+^E. Starting from n = 
we get ABE -^ww ABS°° as the result is the limit of all ABS^E. However, we do not 
have ABE ABS°° as intermediate reduction results in the sequences for ABS^E 
ABS^^^E will change the B to an S. The example does not contradict the proposition as 
the system is not converging: we have ABS^E AS^BE ABS^~^^E -^^ .... 

The requirement that the topological space is Hausdorff is not merely used in (the last 
part of) the proof, the property can indeed fail for non-Hausdorff-spaces. 

Example 3.4. We define a TARS E with carrier set ui and the following sets as an open 
base: 

* Sn = {m \ m > n}, for any n € uj 

• the set of all even numbers 

Its relation — t- is given as n — )• n + 1, for all n. 

For the topology of E every element has a smallest neighbourhood, which is subset of 
all its neighbourhoods. For an odd number n it is the set S„, for an even number n it is Sn 
intersected with the set of all even numbers. 

Since Si = uj \ {0}, sequences converge to 1, unless they have a cofinal subsequence 
that is constantly 0. In that case the sequence might converge to 0, but only if it does 
not contain a cofinal subsequence in which all numbers are odd. As a whole the TARS is 
convergent as all its reduction sequences converge to 1. 

But the proposition about the weak reduction relation does not hold here: we have 
2 -^ww but not 2 0. The reason is that the reduction sequence 2 — )• 3 — )• 4 — )• • • • 
converges to all odd numbers, but to no even numbers. By omitting every other element the 
sequence gives rise to the weak reduction sequence 2 —^w 4 —^w 6 ■ ■ ■ which converges 
to all numbers, including 0; but there is no reduction sequence starting from 2 that would 
converge to 0. 

Aside: it is possible that and -^ww coincide without (5, O, — )■) being convergent — 
e.g. we can choose = S x S. 

Proposition 3.5. Let (S, O,— )■) be a convergent TARS where {S,0) is Hausdorff. Then 
{S,0,^w) is also convergent. 

Proof. Using the same construction as in the proof of Proposition 13.21 we can expand an 
open — »^-reduction into an open — )-reduction. The convergence assumption gives us a limit 
to the latter which must also be a limit to the former. □ 

As the proof is based on Proposition 13.21 we still require that the topology is Hausdorff, 
and the property can indeed fail for non-Hausdorff spaces, as Example 13.41 provides again a 
counterexample: we can define a weak reduction sequence of length u as follows: — ?■ 1 — )• 
2 -^y^ 4 -^yj 6 -^uj . . . which converges to 0. By iterating this reduction we can construct a 
weak open reduction sequence f : uP' ^ uj such that for all limit ordinals A < o;^ we have 
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/(A) = A /(A + 1) = 1 A /(A + n + 2) = 2n + 2. This fails to converge under the topology 
of E, as both g : uj ^ uj'^,g{n) = uj ■ n and h : uj ^ uj'^, h{n) = uj ■ n + 1 are cofinal; fog 
and f oh are constantly and 1, respectively, and thus miss the smallest neighbourhoods 
of all other numbers. 

Corollary 3.6. A TARS is convergent if and only if all its weak reduction sequences con- 
verge, provided its topology is Hausdorff. 

Proof. If the TARS is not convergent then the result follows because any reduction sequence 
is also a weak reduction sequence. Otherwise Proposition [331 applies. □ 

Given a TARS {S, O, — >■) and a sequence f : a ^ S, the predicate Foc/^a on a is defined 
as follows: 

Foc/,„(/3) ^ V7.a>7>/3^3C.a>CAVK.a>/t>C^/(7)-^«;/(^) 
We have: (i) if Foc/,„(/3) then Foc/,„(7) for all 7 > /3; (ii) if Foc/,„(/3) then 37 > /3./(/3) 
7(7). The sequence / is called focussed iff 3/3. Focj^q(/3). 

In words: a sequence is focussed iff every of its elements sufficiently close to a weakly 
reduces to all its elements sufficiently close to a. In particular, all weak reduction sequences 
are focussed, but not vice versa. 

Of interest are specifically open focussed sequences. Closed sequences (a = a' + 1) are 
trivially focussed: to prove 3/3.Foc/^q(/3) we can choose (3 = a' which determines 7 = a', 
and C can also be chosen as a' which in turn determines k = a', and /(a') — holds 
by reflexivity of -^w 

Proposition 3.7. Every cofinal subsequence g of a focussed sequence f has itself a cofinal 
subsequence h such that f o g oh is a cofinal weak reduction sequence. 

Proof. Proposition 4 in [10] claimed this for MARS, but the proof works unchanged for 
TARS, with no constraints on the topology. □ 

Proposition 3.8. A TARS converges iff all its focussed sequences converge, provided the 
topology is Hausdorff. 

Proof. Since all weak reduction sequences are focussed the <^ implication follows from 
Corollarv 13.61 Now suppose / : a — )• 5 is a divergent focussed sequence; from it, we need 
to construct a divergent reduction sequence, and by Corollary 13.61 it suffices if it is a weak 
reduction sequence. 

Take any element c G 5. Proposition 13.11 gives us a cofinal subsequence Wc ■ /3c ^ a 
of / witnessing the lack of convergence of / to c. We can apply Proposition 13.71 to / and 
Wc, which gives us a cofinal subsequence he ■ ^ /3c such that gc = f o Wc o he is a weak 
reduction sequence. 

If gc is divergent we are done. Otherwise it converges to a limit e. Again, we have a 
cofinal subsequence We witnessing that / does not converge to e. From this we can construct 
another cofinal weak reduction sequence h that alternates between elements from gc and 
f o We. This cannot converge to e because of the cofinal overlap with f o We, but if h 
converged to any other e' then because of the cofinal overlap with gc all neighbourhoods of 
e and e' intersect, contradicting the Hausdorff proviso. □ 
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The argument of the proof is essentially the same as in [lOj, but reworded to avoid 
reference to distances. In [lOj it was said that it was "not clear" if the proposition could be 
generalised to non-Hausdorff spaces. It actually cannot, as our earlier counterexample to a 
generalisation of Proposition 13.51 would also a refute a generalisation of Proposition 13.81 

4. Finite Terms and Infinitary Terms 

A signature is a pair S = (J^, ^) where is a set (of function symbols) and # : — t- 
is the function assigning each symbol its arity. We assume an infinite set Var of variables, 
disjoint from J^. The set of finite terms over S is called Ter{S) and it is defined to be 
the smallest set such that (i) Var C Ter{S) and (ii) F(ti, . . . ,tn) G Ter(I!) whenever 
F £ J- A = n A {ti, . . . C Ter{S). The root symbol of a term F{ti, . . . ,i„) is F, 

the root symbol of a variable x is x. 

We write M* for the free monoid over the natural numbers (i.e. finite words), with 
neutral element A and infix • as monoid multiplication. The set of positions Pos(t) of a 
finite term t is the smallest subset of M* such that: (i) A G Pos(t), (ii) {i ■ q \ 1 < i < 
n,q £ Pos(tj)} C Pos(F(ti, . . . ,tn))- Positions are used for selecting or replacing subterms. 
For selecting the subterm at position p of a term t we use the notation t/p. We extend the 
domain of this function by setting t/p = x'lip ^ Pos(t) which allows to extend this notation 
pointwise to sequences of terms: {f /p){a) = f{a)/p. For replacing the subterm at position 
p of a term i by n we use the notation t[u]p; if p ^ Pos(t) then we set t[u]p = t. Positions 
are partially ordered by the prefix- ordering: p ^ p ■ q. 

A T,- algebra is a set A together with functions F^ : A" — )• A for every F £ F with 
#{F) = n. A valuation into A is a function p : Var — )• A. Any E-algebra A determines an 
interpretation function : Ter{X!) x { Var — )• A) — )• j4 as follows: 

|a;]^ = p{x), if X € Var 

IF(ti,...,t„)i^ = F^([tii^,...,[y^) 

Given two S-algebras A and B, a S-algebra homomorphism from A to is a function 
h : A ^ B such that /i(F4(ai, . . . , a„)) = Fsih^ai), . . . ,h{an))- If h : A ^ B is a 
homomorphism then /i([i]^) = [^1^°^- 

A context is a pair of a term t and a position p £ Pos(t). If C[ ] is the context {t,p) 
and u a term the context application C[u] is defined as t[u]p. A substitution is a function 
9 : Var — )• Ter{S) which is uniquely extended to a function 9 : Ter{S) — t- Ter[S) by 
requiring that ^ is a S-algebra homomorphism. 

An ultra-metric map (short: umm) is an n-ary function / : [0, 1]" — t- [0, 1] such that 
(i) /(xi, . . . , x„) = <^=^ xi = A • • • A x„ = and (ii) it distributes over max: 
/(max(xi,yi), . . . ,max(x„,y„)) = max(/(xi, . . . , x„), /(yi^ . . . , y„)). Each n-ary umni / 
can be expressed in the form /(xi...,x„) = maxi<j<„ /j(xj), where the functions : 
[0,1] —7- [0,1], called the components of /, are also umms. A term metric for a signature 
S is a S-algebra m with carrier set [0, 1], such that for each F £ F the function Fm is a 
umm of the same arity. A term metric is called continuous iff all its ultra-metric maps are 
continuous. In this paper, we shall only consider continuous term metrics. 

For every term metric m, the distance function dm '■ Ter{S) x Ter{S) — )• [0, 1] is de- 
fined as follows: dm{t,t) = 0, dm{t,u) = 1 if t and u have different root symbols, and 
dmiF{ti, . . . ,tn),F{ui,. . . ,Un)) = (dm(ti , ui ),..., (t„, «„)) . Some fundamental re- 
sults about term metrics from [9j ctr6i dm is ultra-- ixLGtricj the topology inducGci by drn Is 
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discrete. Moreover, the set of infinitary terms over S and m is called Ter"^{T,) and defined 
as the metric completion of the metric space { Ter{IJ),dm)- Ter'"(S) is a S-algebra if m is 
continuous. 

Each position p in Pos(t) has a unique umm {t,p)m associated with a term metric m: 

{t,X)m = id 

{F{ti, . . . ,tn),i ■ q)m = Fm,i° {ti,q)m 

This associated umm gives us the distance function for position p, in the sense that: 
dm{t[v\p,t[s\p) = {t,p)m{dmiu, s)). As the definition only depends on a finite part of t 
it is even defined for infinite terms t which are not in T'er™'(S). 

The notions of contexts, substitutions, positions and the operations on them lift canon- 
ically from Ter{S) to rer™'(E), at least for continuous term metrics, see |9]. 

Proposition 4.1. The subterm replacement operation t[u\p is non-expansive in t, for any 
term metric. 

Proof. By induction on the length of p. Base case: dmit[u]x,t'[u]\) = dm{u,u) = < 
dm{t,t'). Otherwise p = i ■ q. If t and t' have different root symbols then dm{t,t') = 1 > 
dm{t[u]p, t'[u]p). Finally, let t = F{ti, . . . ,tn) and t' = F{t[, t'J. Then 

dm{t,t') = max Fm,j{dm{tj,t'.)) = max{6, Fm,iidm{ti,t'i))) 

l<j<n 

where 5 = maxi<j<nj^i Fm,jidm{tj,t'j)). Moreover, 

dmit[u\i.g,t'[u]i.g) = max{5, Fm,iidm{ti[u]g,ti[u]g))) 

Induction hypothesis on q gives us: dm{ti,t[) > dm{U[u\g,t'^[u\g), so the result follows by 
monotonicity of the functions Fm,i (as they are umms) and max. □ 

Aside: subterm replacement t[ii\p is non-expansive in u if and only if the term metric 
m is non-expansive, i.e. if all functions F^ for F £ are non-expansive. 

4.1. Characterising Convergence in Term Metric m. The e-positions of a term t 
w.r.t. to term metric m, the set Pos™(t), is defined as {p G Pos(t) | > e}- Clearly, 

Pos™(t) is prefix-closed: ii q-w =p then {t,q)m{l) > {t,q)jn{{t/q,w)rn{l)) = {t,q-w)rn{'\) > 
e. 

The properties of Pos™(t) can be used to characterise whether an infinite term (from 
rer°°(S)) exists (in Ter^Ji;)). 

Proposition 4.2. If t £ Ter"^{T>) then Pos^(t) is always finite. 

Proof. Let t^ G Ter[S) be a finite term such that dm{t,te) = rji < e — such finite terms 
always exist, by construction of the metric completion. 

Let p G Pos^(t) and y £ Var be fresh. Then dm{t,t[y]p) = {t,p)m{l) > e, by 
definition of Pos™(t). We also have dm{t[y\p,tf[y\p) = < rji < e, because subterm 
replacement is non-expansive. Thus, using the ultra- metric properties of dm, we know 
e < (iiP)m(l) = dmii,t[y]p) < max(?7i, max(dm(te, r/2)). As both rji are smaller than 
e this is equivalent to dm{t,t[y]p) < dm{te,t^[y]p), and the dual argument shows that these 
distances must be equal. As that value is non-zero, p must be in Pos(te). This shows 
Pos^(t) C Pos(te) and as the latter is finite, so is the former. □ 
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Thus, for any infinitary term under any term metric there are only finitely many subterm 
positions which — if changed — could change the overall term by e or more. This observation 
plays a role when we reason about divergence. 

An immediate consequence of Proposition 14.21 is that if Pos^ (t) is not finite (for any 
e > 0) then t ^ Ter™(S). This gives us a criterion to rule out certain infinite terms. In 
fact, the criterion is complete: 

Proposition 4.3. Let t G Ter°°(S). // Pos™(t) is finite, for all e, then t G rer'"(S). 

Proof. We need to create a Cauchy-sequence (w.r.t. dm) of finite terms converging to t. To 
make the n-th term tn of the sequence to be within distance of 1/n of the limit t we obtain 
it by replacing all subterms of t in minimal (w.r.t. ^) positions not in Pos™(t) (for e = 1/n) 
by the variable x. As Pos^(t) is finite its elements have a maximal length k and the longest 
position in t„ is (at most) of length A; + 1, and thus by Konig's Lemma t„ must be finite 
too. □ 

An important concept when studying convergence is the equivalence relation that two 
terms have "the same function symbols up to position p". Formally, this is defined by 
induction on the length of the positions as follows: 

F{ti, . . . ~ F(ni, . ..,Un) <S= ti ~ Ui 

Clearly, if a ~ 6 then {a,p)m = {b,p)m', see also [9]. 

Traditionally, infinitary term rewriting has focussed on one particular distance function 
doQ- The same distance function arises through the definition of a term metric: we define 
the term metric oo in which all n-ary ultra-metric maps are given as oo„(xi, . . . , x^) = 
i • max(xi, . . . ,Xn)- It plays a fundamental role as rer°°(S) includes all infinite terms: 

Proposition 4.4. Any Cauchy-sequence in {Ter{S),dm) is also a Cauchy-sequence in 
{TeriS),d^). 

Proof. Let f : lo ^ Ter{E) be that sequence. 

Assume it was not Cauchy under metric doo- Then there is a maximal (under the prefix 
ordering) position p such that f /p changes its root symbol infinitely often. By maximality, 

the symbols above p are fixed for all sufficiently large k, i.e. f{k) ~ f{k + 1), and infinitely 
often have f{k)/p and f{k + l)/p different root symbols. 

This implies (see proposition 6 in [9j) that there is a fixed umm Cm such that for all 
sufficiently large k 

dmifik), f{k + 1)) > Cm{dm{f{k)/p, f{k + l)/p)). 

Whenever we have a change of function symbol at position p the expression on the right 
becomes the fixed value Cm(l). As this happens infinitely often we have a contradiction to 
the pre-condition that / is Cauchy under metric dm- Therefore the assumption was wrong 
and / is Cauchy under doo as well. □ 

Corollary 4.5. For any term metric m, there is a unique continuous extension of the 
identity function from finite terms to Ter'"(S) — ?■ Ter°°(E). 

Proof. The identity is continuous on finite terms between any two term metrics, because the 
topologies on finite terms are discrete for any term metric (proposition 4 in ^). Proposition 
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14.41 allows to extend the function to infinite terms with domain rer™'(S), and such extensions 
to metric completions are always unique when they exist. □ 

Proposition 14.41 shows that Ter°°{Ti) "contains all infinite terms", because any infinite 
term arises as the limit of a Cauchy-sequence under some metric dm, and the corollary 
indicates that the construction is continuous. Thus we can view any element of Ter^(Y^) 
also as an element of Ter°°(T,). It also means that any sequence in T'er™'(S) that fails to 
converge under doo will necessarily also fail to converge under dm- 

Aside: for finite signatures there is a simpler argument to establish Corollarv 14.51 the 
identity function is uniformly continuous (from dm to doo)- However, this is no longer true 
for infinite signatures, e.g. if a signature contained infinitely many unary function symbols 
F{n) with umms F{n)m{x) = ^. 

Proposition 4.6. Let f he a sequence over Ter"^{T,) such that f is converging under metric 
doo to t €z Ter™'(S). Then f is also converging to t under metric dm- 

Proof. As t G Ter™(S), the set Pos™(t) is finite for any e. As / is converging to t under 
doo, there must be a 70 such that V7 > 70. Vp € Pos™(t). t ~ /(7). This implies that t and 
all 7(7) are within e-distance, w.r.t. to metric dm- Thus / stays eventually within the e-ball 
of t, and / must converge to t as this holds for any e. □ 

Therefore, if a sequence of finite terms / converges under metric doo to a limit t G 
Ter°°(S), the question whether / converges under dm reduces to the problem whether t £ 
rer™'(S) or not. If t G rer"^(E), Proposition 14.61 applies and so / converges; if t ^ Ter"^{T,) 
then / cannot converge under dm, because if it did converge to t' ^ t then / would converge 
under doo to both t and t' — which is impossible in a Hausdorff space. 

Let t G Ter°°{T,). An infinite path of t is a strictly monotonic function f : u ^ Pos(t), 
i.e. n < k ^ fin') ^ /(^)- The path / is called full iff the length of f{n) is always n. We 
say that / converges w.r.t. term metric m iff the sequence of umms (t, f{n))m converges to 
the constant function. 

Lemma 4.7. Let f he an infinite path of t. Then f converges w.r.t. m iff the sequence 
(t, f{n))m{^) converges to 0. 

Proof. If {t,f{n))m converges to the constant function then it will converge to when 
applied to argument 1. If (t, /(n))m(l) converges to then by monotonicity {t, f{n))m{e) 
must converge to at (at least) the same rate, and therefore {t,f{n))m converges to the 
constant function. □ 

Proposition 4.8. Let t £ Ter°°(S). Then t G rer"^(S) iff all infinite paths oft converge 
w.r.t. term metric m. 

Proof. Suppose path / does not converge. Then by 14.71 the sequence (t, /(n))m(l) does not 
converge to 0. Thus, (t, /(n))m,(l) > e > for infinitely many n. As each of these is in 
Pos^(t), this set is infinite and t ^ Ter™(S) follows from Proposition 14.21 

Now assume t ^ Ter™(S). Then by proposition 14.31 Pos^(t) must be infinite for some 
e. We can iteratively construct a full non-converging infinite path through Pos™(t) by the 
pigeon hole principle: set /(O) = A; as the symbol at t/ f[n) has finite arity, there must be 
at least one i such that infinitely many positions in Pos™(t) have f{n) ■ i as prefix, and we 
can set f{n + 1) = /(n) • i. As all values in the range of / are in Pos™(t), the sequence 
(t,/(n))m(l) stays above e and is therefore not converging to 0. □ 



12 



STEFAN KAHRS 



5. Infinitary Term Rewriting 



An infinitary term rewriting system (iTRS) is a triple (S, m, R) such that (i) S is a 
signature, (ii) m is a term metric for S, and (iii) R C rer™'(S) x Ter™(S) is a set of rewrite 
rules such that for all {l,r) £ R we have I ^ Var and all variables in r also occur in I. We 
write / — 7- r if (l, r) S R. 

Note: this definition permits infinite terms on both left-hand and right-hand sides of 
rules, as well as repeated variables in the left-hand side. For better continuity properties 
of the rewrite relation one would typically require in addition that each / is a finite linear 
term. Moreover, |9j required all rules to be depth-preserving (see below). However, for the 
purposes of this paper these restrictions are not necessary. 

A relation S on Ter''"(S) is called substitutive iff s S t implies 9{s) S 9{t), for any 
substitution 9. It is called compatible iff s 5 t implies C[s] S C[t] for any context C[ ]. 

The single-step rewrite relation — t-r of an iTRS {T,,m,R) is defined as the compatible 
and substitutive closure of R, i.e. C[9{1)] -^r C[9{r)] whenever (/,r) £ R, C[] is a context, 
and ^ is a substitution. 

Aside: there is a slight issue with the definition of "compatible" . For better continuity 
properties one could require in addition that a compatible relation should be reflexive on 
infinite terms. If t is an infinite term then a rewrite step t[l]p — t-zj t[r]p should vanish in 
the limit, as p goes arbitrarily deep. This is necessary to ensure that the relation — t-jj is 
upper semi-continuous, and that it is closed. For the purposes of this paper the distinction 
is immaterial as the presence of reflexive rewrite steps does not affect the convergence 
relations. 

We associate with an iTRS (5],m,,i?) the MARS ( T'er'"(S), dm, —>•/?); in particular, the 
iTRS is called convergent iff its associated MARS is. 

In the following we will be using a generalisation of the notion of depth-preservation 
|12j to arbitrary term metrics. In term metric oo, the depth of a variable x in a term t is 
the length I of the shortest position pint such that t/p = x. Equivalently, that information 
is also present in the distance doo{t,t[y /x]) = (t,p)oo(l) (for fresh y), because that distance 



For other term metrics that figure is insufficient to fully capture the notion of depth of 
a variable in a term, because it only tells us what happens with distances when terms 
with different roots replace the x. More generally, we are interested in the distances 
dm(t[u/x],t[s/x]) for varying u and s; the dependence on u and s can be replaced by a 
dependence on dm{u, s). Thus the generalised variable depth of x in t w.r.t. term metric m, 
vdepth{x,m,t), is an ultra-metric map of type m ^ m, given as: 



A rewrite step t ^ u preserves the depth of x iff vdepth{x,m,t) > vdepth{x,m,u), in the 
pointwise order on Var — )■ m. Overall, a relation R on Ter™(S) is called depth-preserving 
iff Vt, M. t -R u =^ [t] > [u] which means that all its rewrite steps preserve the depth of all 
variables. Here, the order > is the pointwise order on the function space, inherited (twice) 
from m. 

An iTRS (for term metric oo) is strongly converging iff for all open reduction sequences 
of length a no redex position p is reduced arbitrarily close to a. Equivalent to this [23] is the 
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'depth{x,m,t){y) = [t]^*^^ where p{y){z) 
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notion of top-termination, i.e. that no reduction sequence contracts a root redex infinitely 
often. 

It is possible to define the property without referring to redex positions though: the 
indirected version of an iTRSs (S, m, R) is defined as follows. Its signature is S extended 
by a fresh unary function symbol /; for each rule Z — )• r in i? it has a rule / — )• I{r); in 
addition, it has the rule I{x) — )• x. We extend the term metric m by setting Im{x) = x. 

Proposition 5.1. An iTRS over term metric oo is strongly converging iff its indirected 
version is converging. 

Proof. Any reduction sequence / of the original iTRS can be mapped into a reduction 
sequence /' of the indirected version by splitting the step into two parts, where the second 
part removes the just introduced /-symbol. If / is not top-terminating then /' fails to 
converge, as it would have infinitely many changes of root-symbol. 

Dually, a reduction sequence g of the indirected version can be mapped into a (shorter) 
sequence g~^ of the original system, by removing all /-symbols. However, g can only fail to 
converge by an infinitely often occurring function symbol change at some minimal position 
p. That means though that g must apply rules other than /(x) — )• x at position p infinitely 
often, and thus g~^ would apply those rules at p or higher infinitely often which means that 
g~^ is not strongly converging. □ 

The reason that this proposition is limited to term metric oo is simply that the notion 
of strong convergence is only defined for metric d^o. The idea behind strong convergence is 
that redex contractions must contribute less and less to the diameter of a sequence as redex 
positions go deeper. For other term metrics the movement of the redex positions alone does 
not suffice, similarly as modifications of the metric for the infinitary A-calculus page 
709f] affect the notion of "root-active" . 

For other term metrics than oo we define strong convergence as the convergence of the 
indirected iTRS. 

Consequence: consider the term metric id with Fi(i{xi, . . . , x„) = max(xi, . . . , x„) for 
all function symbols F. This is a discrete metric and there are no infinite terms: Ter^'^{T,) = 
Ter{S). Thus, convergence means that reduction sequences are eventually constant, and 
strong convergence coincides with strong normalisation. 

This also shows that strong convergence is not in general a modular property, simply 
because strong normalisation is not. However, strong normalisation is modular in the 
absence of collapsing rules so we will consider the corresponding constraint here. 

6. Examples, Counterexamples and Near Counterexamples 

In this section we will be looking at examples that show why the modularity results for 
convergence come with side conditions. The examples here are using metric doo] to consider 
other metrics at all we would have to first explain what the disjoint union of two iTRSs 
with arbitrary metrics is, see the next section. 

6.1. Collapsing Rules. Particularly problematic w.r.t. convergence are collapsing rules, 
i.e. rules that have a variable as their right-hand side. An iTRS immediately fails to be 
strongly convergent if a collapsing rule is present. The reason is this: any such rule takes 
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the form C[x, . . . , x] — )• x, the equation t = C[t, . . . , t] has a (unique) solution in rer°°(S), 

and t \ t gives an infinite reduction sequence with contractions at the root. 

The weaker notion of convergence can coexist with collapsing rules, but barely: if 
C[x\ —7-* X holds for any other contexts C[ ] than those of the form C[y\ = F'^{y) (for varying 
n but fixed F) then such an iTRS is not convergent either. In particular, the presence of 
two rules F{x) — >• x and G{x) — )• x means that the terms t = F{G{t)),u = G{F{u)) reduce 
to each other and thus give rise to a non-converging reduction sequence. The example also 
shows that convergence (on its own) is not a modular property. 

Convergence can also be broken if only one of the two systems has a collapsing rule. 

Example 6.1. System R has rule G{H{x)) — )• G{x), system S has the rule F{x) — )• x. 
In the combined system there is the term t = F(H(t)) and the non-convergent reduction 
G{t) G{H{t)) G{t). 

6.2. Building Infinite Results. Typically we can observe convergence (or its absence) at 
cj-indexed reduction sequences. In the presence of non-left-linear rules this can be rather 
different though. 

Example 6.2. (from ^) {E S{E), F S{F), G{x,x) G{E,F)} 
This fails to converge, but this failure first occurs at w^-indexed reduction sequences. We 
have G{E, F) ^* G(5"(^), 5™(F)) for any m, n and in the limit G{E, F) G{S^,S^) 
G{E,F). This does not provide any convergence problems at ordinals of the form w • k 
though; however, when we attempt to extend a reduction sequence to cj^ we find both 
G{E,F) and G{S'^ , S°°) in the image of the reduction sequence at any of its neighbour- 
hoods. 

This is a common pattern for establishing non-convergence: any sequence of length a 
that begins and ends in the same value, but that visits a different value in between, gives 
rise to a non-converging sequence of length a; • q by simply repeating that sequence ui times. 
That construction works in any Hausdorff space. 

Example 6.3. In Example 16.21 we had E — ;>* S'^{E) for any n, and E S°° in the limit. 
However, one can have an iTRS with the former but not the latter property, e.g. if we 
replace the first rule by these four: 

{E^ Z, E^ H{E), H{Z) S{Z), H{S{x)) S{S{x))} 

Patterns like this complicate the reasoning about (weak) convergence, because in a combined 
system an outer layer may place in a position p different reducts of E, and thus the term 
in position p could converge at a limit to S°°, showing that a principal subterm at a limit 
may not be a reduct of an earlier principal subterm. This happens when we combine this 
system with the following: 

{J{K{x,y)) ^ J{y)} 

Let t = K{E,t) and u = K{S'^,u): we have t = K{E, K{E, . . .)) K{Z, K{S{Z), . . .)), 
and thus J{t) ->^w JiK{Z, K{S{Z), . . .))). Continuing the reduction from there by repeat- 
edly applying the rule J{K{x,y)) — )• J{y) we have overall J{t) J{u). 

What is remarkable about this is that E is the only principal subterm of J{t) and it does 
not reduce to S°°, the only principal subterm of J{u). However, this second iTRS already 
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fails to converge: given the term s = K(J(K{x, y)), s) we can construct the non-converging 
sequence J{s) J{K{J{y)),s) J{s)- 

This construction is by no means arbitrary, as we shall see later: if a reduction sequence 

/ in the combined system "substantially rearranges" principal subtcrms then one can con- 
struct a non-converging reduction sequence in the iTRS to which the root symbol belongs, 
where the role of principal subterms is played by some fixed terms I ^ r with I ^ r. 

6.3. Beyond Ordinary ITRSs. What happens to convergence if wc remove any con- 
straints on rewrite rules, i.e. if we allow left-hand sides to be variables (expansive rules), 
and right-hand sides to contain extra variables the left-hand sides does not? 

One of the main reasons these kind of rules are excluded in finite term rewriting is that 
either kind of rule immediately breaks the strong normalisation property. Another reason 
is more pragmatic: the presence of such rules requires a very different implementation. 

Expansive rules immediately break strong convergence, but they may not break weak 
convergence. The rule x — t- F{x) on its own is weakly convergent, and if disjointly added 
to a strongly convergent iTRS the result would be weakly convergent. On the other hand, 
the presence of a second expansive rule y — )■ breaks weak convergence: x — )■ F{x) — ^ 

G{F{x)) — >■ F{G{F{x))) Similar to collapsing rules, the presence of two different 

function symbols in an expansive rule breaks convergence: in that case the right-hand side 
must have the form C[F{ti, . . . , tn)\p where the root symbol of C[ ] is different from F. The 
reduction sequence that applies the rule alternatively in positions A and p is non-converging, 
as the function symbol in position p changes every time. 

Rules with extra variables may not only be convergent, they may even be strongly 
convergent. An example for that is the rule Fix) — )• G{y). This is not only convergent on 
its own, it preserves convergence if disjointly added to any convergent iTRS. A rule like 
I —7- y, where the right-hand side is an extra variable, can only be convergent if / is a ground 
term, as otherwise two different substitution instances of / could rewrite to each other. If 
I is ground then I y can only be convergent for (metrics homeomorphic to) doo, as the 
rule can be used to gradually build terms of any infinite shape. 

But that does not mean that extra variables are unproblematic: rules with extra vari- 
ables always fail to be depth-preserving, for any term metric. They also break many proof 
techniques, as they enable the rank of terms to go up by rewriting. 

7. The Disjoint Union of iTRSs 

A signature morphism between signatures (J^i, #i) and #2) is a function u : J^i — >■ 
J-2 that preserves arities. This gives rise to a translation between finite terms a defined as 
a(F{ti, . . . ,tn)) = a{F){a{ti), . . . ,a(t„)), and o'{x) = x for a; G Var. 

A metric signature morphism between (Si, mi) and (S2,m2) is a signature morphism 
cj : El — 7- E2 such that for all F £ J^i we have: F^i = (j{F)m^. For a metric signa- 
ture morphism a the function a is a homeomorphic embedding as it preserves distances: 
dm2^{t)^^{u)) = dmi(t,u), and therefore it extends to infinitary terms. 

Notice that signature morphisms preserve the term metric m, not just the derived 
notion of the metric dm- Different term metrics can give rise to identical topologies: 

Example 7.1. If S comprises a single unary function symbol F then both term metrics 
Pid{x) = X and Fr{x) = ^/x give rise to identical metrics and therefore produce the same set 
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Ter' (S) = Ter^(S) = Ter{S), with no infinite terms at all. But, combined with another 
unary symbol G with term metric Goo{x) = x/2 the two versions of F behave differently: 
the term t = G{F{t)) exists with Fid and does not with Fy. 

An iTRS morphism from ($]i,mi,i?i) to {T,2,m2, R2) is a metric signature morphism 
a : (Si, mi) — )■ (£2,^2) such that a{Ri) C R2, where W is canonically extended to rules 
and sets of rules. 

Infinitary term rewriting systems and their morphisms form a cocomplete category. The 
disjoint union R + S of two iTRSs R and S is defined to be the coproduct in that category. 
This contains all symbols from both systems (possibly renamed if there was a conflict) 
and all rules from both systems whilst preserving the umms for all function symbols. As 
distances are preserved by iTRS morphisms we shall use the metric dm of the coproduct 
also for distances of the constituent iTRSs. 

The preservation of the metric by the injection into the coproduct also settles the 
"reflected by" part of modularity properties: 

Theorem 7.2. The properties of convergence and strong convergence are reflected by the 
disjoint union of iTRSs. 

Proof. Convergence: assume R + S is converging, let li be the coproduct injection from R 
to R + S. Any open reduction sequence / of i? is mapped by ti into an open reduction 
sequence oi R + S which by assumption is converging. As li is an isometric embedding (and 
as its domain is complete) it reflects the convergence back to /. 

Strong convergence: any counterexample to strong convergence is clearly preserved by 
Li, as the map does not affect redex positions. □ 

A signature extension of an iTRS R is the disjoint union of R with another iTRS with 
empty set of rules. 

One might expect that signature extensions preserve most properties of interest, but 
that is questionable. We can even have that a signature extension turns a strongly converg- 
ing iTRS into one that is not converging at all. 

Example 7.3. Consider the iTRS with rule F{F{x)) — )• G{x) under term metric 00 — this 
is strongly converging. If we extend the signature with a function symbol H with associated 
umm Hm{x) = min(l,2 • x) then convergence is broken: the term t = F{F{H{t))) exists 
under the combined metric, but repeatedly applying the rule to this term could only give 
us n = G{H{u)) which does not exist in that metric. 

There are two "culprits" contributing to this: (i) the signature extension used an ex- 
pansive umm, (ii) the iTRS was not depth-preserving. In this case, the counterexample can 
be reconstructed with non-expansive metrics: 

Example 7.4. Modify Example l7.3l bv setting Fm{x) = x^, Gm{x) = x, Hm{x) = min(x, i). 
In this case, neither of the two iTRSs permits infinite terms, but their combination does; 
the counterexample is preserved, as t does and u does not exist under this metric. 

8. Auxiliary definitions and observations 

When studying the disjoint union of term rewriting systems R and S it is useful to 
split a term into layers of the participating systems, and distinguish its principal subterms. 
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The principal subterm positions of a term t, PPos(t) C Pos(t) have the following properties: 
p is principal if (i) the root of t/p is a function symbol belonging to a different signature 
than the root of t; (ii) no proper prefix of p is principal. The top positions of a term t, 
TPos(i) C Pos(t) are defined as follows: TPos(t) = {p G Pos(i) \ ^q- q -< p ^ q ^ PPos(i)}. 
Thus top positions are either parallel to principal subterms or they are their prefixes. 

A principal path of t is an infinite path / where /(O) = A, /(n + 1) = f{n) ■ pn such 
that Pn G PPos(t//(n)). The modularity of convergence is tightly connected to the question 
whether principal paths that arise through rewriting are convergent. 

The top layer of a term is comprised of all the positions reachable from the root without 
a change of signature, and is thus "cut off" at its principal subterm positions. This leaves 
the layer with gaps at the cut-off points; therefore, the top layer is technically defined as 
the function that recreates a term if suitable fill material for the gaps is provided: The 
top-layer of a term t is a function [t] with type (PPos(t) — )• Ter™(S)) — )• Ter'"(S) where 
we write \t~\^ for the application of \t~\ to a function ^ : PPos(t) — )• rer™'(S). The top layer 
has the following properties: above principal subterm positions it agrees with t: 

V^VpG PPos(t).t~ [t]^ 

In positions parallel to principal subterms, t and [t] coincide: 

V^. Vg G Pos(t). (Vp G PPos(t). q\\p) => t/q = \t'\^/q 

And in principal subterm positions themselves the gaps are filled with the function ^: 

VpG PPos(t). \t\/p = i{p). 

The standard notation for layers in finite terms is C|ti, . . . , tnj- Elsewhere |19j this has 
been adapted for infinitary terms as C[tjjg/, to use some (possibly infinite) set / as indexing 
set for those positions. The main reason for departing from that is that the \t~\ notation 
avoids the need of a separate indexing set by using PPos(t) itself, as \t~\^ = t|C(p)lpGPPos{t)- 
We can define a distance between top-layers: \u\) = dm{\t\kx-, \u\kx) where kx 

is the function that constantly returns x, where x is a fresh variable. 

Lemma 8.1. Lett G rer™(S). Let f : a ^ PPos(t) Ter'^{T) he a function such that for 
each p G PPos(t) the functions gp : a ^ Ter'^{'L) defined as gp{f3) = f{j3){p) are converging 
sequences. Then h : a ^ Ter™(S) defined as h{f3) = [il/{/3) is a converging sequence. 

Proof. Suppose t is a finite term. Then PPos(t) is finite, so / is essentially an n-tuple 
of converging sequences which is also a converging sequence of n-tuples. Moreover, [t] is 
uniformly continuous, as all term- forming umms are (Proposition 5 in [9j), and as uniformly 
continuous functions preserve Cauchy-sequences the result follows. 

Otherwise (i.e. t is infinite), for any e > there is a finite term t^ such that dm{t, t^) < e. 
W.l.o.g. we can assume PPos(te) C PPos(t). Thus the previous argument applies to t^ and 
/, i.e. the function /i^ defined as /ie(/3) = [t^] j(^) is a converging sequence. Moreover, as 
subterm replacement is non-expansive (Proposition l4.ip the functions h and h^ are pointwise 
within e-distance. Thus h stays eventually within an e-ball, and — as this holds for any e 
— h must converge. □ 

From this we get a compositionality property for converging sequences: 

Corollary 8.2. Let f : a ^ Ter™(Il) be a sequence such that [/] converges to [u] and 
f Ip converges for any p G PPos(m). Then f converges. 
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Proof. We can apply Lemma 18.11 to u and f /p, giving a converging sequence /i : a — >• 
Ter™'(S) with = Moreover, the distances between h and / are pointwise 

the same as the distances between their top-layers, which means that / must converge to 
the limit of h. □ 

The (possibly infinite) rank of an infinitary term t is defined as follows: 

rank(t) = sup{l + rank{t/p) \ p £ PPos(t)} 

From now the reasoning will be about the disjoint union of two non-collapsing iTRSs 
R and S with rewrite relation -^rs and combined signature S, and for the top layer of any 
reduction (step or sequence) it is assumed that it is situated in system R, with signature 
Sr. 

In the following we will be using a construction in which all principal subterms of a 
term t at finite rank n are replaced by a fixed term u, t[n \ u]. Formally: 

t[0 \u]=u 

t[n + l\ u] = \t]^ where £,{p) = {t/p)[n \ u] 

This notation is also extended to sequences of terms: if / : a — t- Ter™'(S) is continuous 
then /[n \ : a — )■ Ter^iTj) is defined pointwise as /[n \ n](/3) = f{/3)[n \ u]. Notice 
that the function i i— )• t[n \ u] is non-expansive and therefore preserves the continuity of 
the sequence. 

Lemma 8.3. Let f : a ^ Ter™'(S) be a reduction sequence. Let u G Ter'"(S) and n £ M 
be arbitrary. Then the sequence /[n \ n] is a reduction sequence of the reflexive closure of 

Proof. The absence of collapsing rules means that principal subterm replacement at level n 
commutes with reduction steps at level m < n, even for contraction of non-left-linear redexes 
(even if rules with extra variables were permitted). Limits are also preserved because the 
operation is non-expansive. Reduction steps at lower level become reflexive steps. □ 

Any reduction sequence of the MARS (M, d, — )-^) can be turned into a (possibly shorter) 
reduction sequence of (M, — )•). Thus, the presence of reflexive steps in /[n \ u] is merely 
a technicality. The full generality of Lemma 18.31 will be used later; but of special interest is 
the case n = 1: 

Corollary 8.4. Let f : a ^ Ter'"(S) be a reduction sequence. Let u G Ter"^{Tiji). Then 
f[l \, u] is a reduction sequence of the reflexive closure of ^r. 

Proof. As all terms of the sequence are by construction S/j-terms each — )-R5-step is a — s-r- 
step. □ 

We define the principal positions of a reduction sequence / : a — )• Ter'^(S) as follows: 

ppos(/) = u n pp°s(/(7)) 

f5<al3<j<a 

Lemma 8.5. Let f : a ^ rer™'(S) be an open reduction sequence. If R is convergent then 
the top layers of f converge to some top layer [u] where PPos(?x) = PPos(/). 
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Proof. Since the iTRS of the top layer is convergent then so is /[I \ x], for some fresh 
variable x, and let its limit be t. Let c be any non-variable term in the other iTRS then by 
substitutivity of weak convergence we also have /[I \ x][c/x] = /[I \ c] ^[a^/c], and 
we can set u = t[x/c] since \u] does not depend on the choice of c. Clearly, any principal 
position in t[x/c] must be principal for \ c] for all 7 in the open interval a) for 

some 13. □ 



9. Modularity of convergence 

In Example l6.3l we have seen how a top-layer can through rearrangement of its principal 
subterms interfere with the convergence at subterm positions. However, the example that 
rearranged principal subterms "in a substantial way" was divergent. 

We will show that this is not an accident. The idea behind this is the following: if / 
is an open reduction sequence then so is 51 = /[I \ /] and if Hs a term belonging to the 
(converging) top-layer then g must converge. Moreover, if / — )• r (in the interesting case 
/ ^ r) then g must still converge if we interleave it with the occasional reduction of Z — >• r, 
provided we preserve non-left-linear redexes along the way. 

It is impossible to have both I and r occur infinitely often in a fixed position p, because 
that would create a divergence with diameter of at least doo{l,r) ■ 2~l^l in Ter°°($]), which 
by Proposition 14.41 gives divergence for Ter"^{T,) as well. That means that two situations 
are ruled out: (i) that the top-layer reduction drags eventually infinitely many different 
subterms into position p — as in / [I \ ^] each I could be reduced once it appears in position 
p; (ii) at the same time infinitely many terms appearing in position p are "descendants" of 
a principal subterm t and infinitely many others are not. In that case we would create a 
non-converging sequence by replacing all weak reducts of t in / by r, and all other principal 
subterms by I. With both these scenarios ruled out one can show that the sequence of 
subterms in position p must be a focussed sequence. 

These ideas will be formalised in a slightly different way that avoids the tracing of 
redexes and the rather cumbersome notion of "descendant" — which is impossible anyway 
for non-strong reductions |18]. 

9.1. Predicate Sequences. A predicate sequence is an ordinal-indexed sequence of pred- 
icates s on terms such that /3 < 7 A t ->^w u A s{'~f){u) s{/3){t). A predicate sequence is 
meant to be used alongside a reduction sequence of the same length, w.r.t. the same sig- 
nature and term metric. The idea behind predicate sequences is that they are used as a 
tool to decide when to reduce an Z to r in /[I \ Z]; or rather: some principal subterms 
will be replaced by I, some by r, and the construction makes sure that reduction steps are 
preserved. So the motivation behind the definition of a predicate sequence is this: once we 
have decided to replace a principal subterm t by r (rather than I) then (i) we cannot change 
our mind later, i.e. for a larger indexing ordinal, and (ii) all "descendants" of t need to be 
replaced by r too, which we can safely over-approximate as all weak reducts of t. 

Let f : a ^ Ter'^{'E) be a reduction sequence and p be an w-sequence of positive 
integers. The function fp:a^ Ter™(S) — )• B maps ordinals to predicates on terms, and 
is defined as follows: 

/p(/3)(t) ^37./3<7<aApG PPos(/(7)) A t f{^)/p 
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Thus the predicate fp is true for terms that weakly reduce to terms which appear in principal 
position p later in the sequence. 

Lemma 9.1. fp is a predicate sequence. 

Proof. Assume P < j and t u and fp{'y){u). fp{^){u) means that there is a C > 7 such 
that j» is a principal position in /(C) and u f{C)/p- In order to show that fp{f3){t) we 
can choose the same ^ as the witness; /? < C holds because P < j < C t f{C)/P holds 
because t -^^i u f{C)/p, and is transitive. □ 

Explanation: the function fp watches the subterms in position p, where the ordinal 
parameter can be seen as a notion of time; fp is true for a term if a weak reduct of the term 
will "in the future" appear in position p. The only way fp can (in time) switch from true 
to false is when at some point in the reduction sequence the term in position p is replaced 
by another that is not a reduct of the previous. This is only possible if a redex in a position 
above p is contracted, i.e. a redex of the top layer; in the simulated sequence this redex can 
be employed to replace an r in position p by an / taken from elsewhere. 

Another predicate sequence we shall be using is not dependent on the ordinal, and 
defined as: kt{P){u) -i(t u). Thus kt is constantly true for all terms that are not 

weak reducts of t. 

Lemma 9.2. kt is a predicate sequence. 

Proof. Let r s and kt{'j){s). Thus -i(t s). To show kt{P){r) we need to show 
-i(t -^w r). But t -^w r gives t r s and so by transitivity of -^w the contradiction 
t s. □ 

Given a rule I r in R and a predicate sequence s we define a dependent function 
: m : rer"'(S).a ^ PPos(t) ^ rer'"(S) as follows: 



ifs{/3){t/p) 
otherwise 



Explanation: the reason this is a dependent function is that the type of the third parameter 
PPos(t) depends on the first parameter, the term t. Thus ^s(i)(7) is a function of type 
PPos(t) Ter'"(S), which is exactly the type of function needed to fill the holes in \t~\. 

Proposition 9.3. Distributive properties of w.r.t. subterm selection and substitution 
application in top positions: 

Vp G TPos(t). \t]^^(^tm/p = \t/p]ut/pm 

\fu G Ter"\^R). ra(u)]^^(5.(„))(^) = cj'(u) 

where Vx. a'{x) = [cr(a;)]5^(^(^))(^) 

Proof. The first property is trivially proven by induction on the length of p. The second is 
a corollary, because if u G Ter"^{T,ji) (i.e. only using function symbols from the signature 
of the top layer) then all variable positions in u become top positions in a{u). □ 
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The function is used to create reducts of t[l \ /] by reducing some of the / to r, 
guided by the predicate sequence. In particular, it is used to replace principal subterms in 
a reduction sequence. 

Proposition 9.4. The fundamental properties of the function are: 

t -^Rs u rtl^4t)(/3) -^R \u]^s(u)ii3) 

Proof. First property: let q be the redex position. At positions p parallel to q, t ^ u thus 
also \t]^^(t)(i3) ~ 

li p ^ q for some p G PPos(t) then certainly p £ PPos(u) and and M^j,(n)(/5) 

also coincide in all positions that are prefixes of p. At position p itself these two terms have 
either I or r. If ^s(n)(/3)(p) = r then there is nothing to show, as both / — t-^ r and r — r. 
If ^s{u){f3){p) = I then we know by the definition of that s{l3){u/p) holds. Since we know 
t/p -^RS u/p we can use the predicate sequence property of s to establish s{/3){t/p). This 
means by the definition of that £,sit){/3){p) = I, and therefore = as 

the terms also agree in position p. 

Otherwise the redex g is a top position, t/q = cr(w), u/q = a{w) for some rule v ^ w m 
R. Using Proposition 19.31 we get: = C'[a'{v)] and \u]^^(u){^) = C'[a'{w)] for some 

context C and some substitution a'. 

So in particular: \t]^^(t){i3) = C'[a'{v)] ^r C'[a'{w)] = \u]^^(_um. 

The reason for the second property is that <;= 5(7); consequently, fewer (but not 
more) principal subterms might be set to I; again, all these can be reduced to r — this 
might require u steps, if the top layer of t is infinite. □ 

Given an open reduction sequence f : a ^ Ter™($]) and a predicate sequence s we 
write \f~\s ■ OL — )• Ter™'(S) for the function defined as follows (here n is a finite ordinal and 
A a limit ordinal or 0): 

[/]s(A + 2 • n) = [/(A + 
[/]3(A + 2 • n + 1) = [/(A + 
This definition may require some explanation 

(1) Because / is open, a is a limit ordinal and thus A + n < a implies A + n + n + l<a, 
for finite n. Hence a remains as the domain for 

(2) That the case distinction into even and odd ordinals is well-defined and covers all cases 
follows from the normal- form theorem for ordinals page 323]. 

(3) The purpose of this definition is the following. A single reduction step /(/3) — >• /(/3 + 1) 
of the original sequence is split into two stages: 

r/(/3)i6(/(/3))(/3) r/(/3)i6(/(/3))(/3+i) r/(/3 + 

This follows from Proposition [931 Thus the function \f^s is a weak reduction sequence. 

(4) As all terms in [/] ^ belong to system R the sequence must be convergent — the same 
argument also explains why [/]« must be continuous at limit ordinals A < a, i.e. why 
it is a proper sequence. 

We can illustrate the construction [/] ^ more concretely on an example. 

Example 9.5. Consider the two iTRS from Example l6.3| they allowed to eventually rewrite 
J{t) where t = K{E, t) to J{u) where u = K{S°°,u). One way of doing this in a; -|- a; steps 
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is by first rewriting t using rules of the first system: t ->^w K{Z, t) -^w K{Z, K{S{z),t)) -^w 

K{Z,K{S{Z),K{S{S{Z)),t))) -»-yj After reaching a hmit at lo we can apply the single 

rule of the other system repeatedly at the root: if Un = K (S'^ (Z) , Un+i) then J{un) — >■ 
J{un+i)-, reaching J{u) in the limit. 

If this is our sequence / we can form e.g. for p = 1 • 1. The initial term of 

that sequence would be to = J{(^q) where ag = K{l,aQ) — for I we can choose J{K{x,y)) 
as it is a left-hand side of a non-trivial rewrite rule. Rewriting E to Z in position p 
does not change that, because all principal subterms can still rewrite to that term, so 
ti = to- Once we rewrite the other occurrences of E with the rule E H{E) they no 
longer weakly reduce to Z, but they still weakly reduce to some term that appears in 
position p in the second half of sequence /. Therefore, \f~\fp is constantly to for the first 
cj steps. The step f{uj) = J{uq) — ^ J{ui) = f{oj + 1) is split in into two steps: 

\f]f^{uj) = J(ao) J{K{r,ao)) — )■ J(ao) = \f]f^{uj + 2). The first step rewrites the 
topmost / to r, because none of the terms appearing in position p in sequence / henceforth 
is a weak reduct of Z. All the other principal subterms will still appear in p and thus they 
will remain I in \f]fp. For all subsequent steps exactly the same happens, as S{Z), S{S{Z)), 
etc. will eventually appear a last time in p. Therefore, in the second half of [/] the values 
always alternate between J(ao) and J{K{r,ao)), which makes \f~\fp non-convergent. 

The reason a convergent reduction was mapped by this construction to a non-convergent 
one is that / is convergent "in the wrong way" , i.e. its principal subterms in p do not form 
a focussed sequence, which is only possible if the system of the top layer is itself non- 
convergent. 

9.2. General Observations. 

Lemma 9.6. Let f : a ^ Ter™(S) be an open reduction sequence and let R he convergent. 
Let p G PPos(/). Then the sequence f /p is focussed. 

Proof. First notice that \f]fp converges. By construction, that sequence has an I in po- 
sition p for all even ordinals (including limits), but it reduces that I to r whenever the 
corresponding subterm in / fails to have any more reducts in position p. This cannot hap- 
pen arbitrarily close to a, so there is a cut-off point /? after which all f{'y)/p have reducts 
f{K)/p, for some «; > 7. 

Now assume, for some C, > 13 there was no cut-off point (j) such that for all ip > (p, 
fiC)/p /(V')/P- Then we could construct the weak reduction sequence \ f~\kf(^t;)/p 
This replaces all weak reducts of f{C)/p in principal positions by r and all other principal 
subterms by I. By the assumption and the previous observation we have both I and r 
occurring at p arbitrarily close to a which contradicts convergence. Hence the assumption 
was wrong and / must be focussed. □ 

This result means together with the previous propositions about focussed sequences that 
if reductions in principal subterm positions would converge then so would the reduction as 
a whole. But why would they converge? We certainly have the following: 

Theorem 9.7. // two non-collapsing iTRSs R and S are convergent and f is a reduction 
sequence of their disjoint union then /[n \ s] is convergent, for any n and s. 

Proof. Convergence is proved by induction on n. 
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For each p £ PPos(/) the sequence f /p is focussed by Lemma \9M and so is /[A; + 1 \ 
s]/p = f /p[k \ s\. Thus by the induction hypothesis and Proposition 13.81 these sequences 
must converge. This means we can apply Corollary 18.21 establishing the result. □ 

Example 9.8. Consider the iTRS under term metric oo with rule F{F{x)) — >• G{x), com- 
bined with a rule-less iTRS with function symbol H and term metric Hfi{x) = min(l, 2 • x). 
We can construct the infinite term t = H{F{F{t))) in the combined metric. We can also 
construct an open reduction sequence g of length uj by starting with t, always reducing 
its topmost redex. By Theorem 19.71 each g[n \ s] is converging, but g itself is not, the 
distances between g{k) and g{k + 1) always remain 1. 

In Example 19.81 the sequence was diverging, because the infinite term u = H{G{u)) 
(which exists in Ter°°{Ti)) towards which the reduction was seemingly heading did not 
exist in Ter'"(S), as its finite approximants do not form Cauchy-sequences under metric 
dm- This is always the case: 

Proposition 9.9. If two non- collapsing iTRSs R and S are convergent and f is a reduction 
sequence of their disjoint union then f converges under the metric doo- 

Proof. From Theorem 19.71 we know that f[n \ s] is converging (under metric dm), and 
by Proposition 14.41 we also know that it is Cauchy under metric doo. Under that metric 
the sequences / and /[?^ \ s] are pointwise within distance of 2~", and therefore / must 
eventually stay within a ball with diameter 2~". As this holds for any n, / must be Cauchy 
(and by completeness: converging) under metric doo- CH 

Corollary 9.10. Convergence is a modular property for non- collapsing iTRSs over metric 
doo- □ 

This slightly extends the result from [lOj as the proof here also covers generalised iTRSs 
with extra variables. Another consequence of Propositions 19.9 1 and 14.6] is that the question 
whether such a sequence / is converging under metric dm reduces to the question whether 
its limit t under doo is a term in Ter™(S), and by Proposition 14.81 this is equivalent to 
whether all its infinite paths converge. However, infinite paths that traverse through only 
finitely many signature changes would be contained in the positions of some t[n \ s] and 
therefore must converge. Thus it suffices to only consider principal paths. 

9.3. Conditions on the Metric. In general, the interplay between the umms of the con- 
stituent term metrics can be quite subtle. As Example 17.41 shows, it is even possible that 
only this interplay gives rise to infinite terms. In this section we are looking at metrics 
which are much curtailed in that respect. 

A term metric g is called granular iff each component Fi^g of the umm of each function 
symbol F is either the identity function or the halving function. Clearly, when we form 
the disjoint union of two iTRSs with granular term metrics the resulting term metric is 
granular. 

As a special case oo is granular. Moreover, as indicated in the Introduction, the term 
universe associated with algebraic types in Haskell with strictness annotations is expressible 
through granular term metrics. 
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Granular term metrics have comparatively simple convergence properties: a full infinite 
path f oit E Ter°°(Ti) is converging w.r.t. term metric g iff it contains infinitely many non- 
strict positions. If / is a path of t then we define the step function as follows: 



Although that definition would technically carry over for other term metrics, for granular 
term metrics the step-function tells us about the rate of convergence. 

Collapsing rules cause problems in term metric oo for two separate reasons: 

• they allow to make layers merge, so that computations originating in a lower layer 

can impact on a higher layer; 

• the top layer can disappear altogether, so that in terms of overall convergence the 
computation has made no progress up to the point where that layer collapsed. 

With granular term metrics there is a different kind of rule that recreates the second prob- 
lematic aspect of collapsing rules. This happens with a rule / r when for some variable 
X, vdepth{x,r, g){l) = 1 and vdepth{x,l,g){l) < 1; such rules are called pseudo-collapsing. 
Note that if all argument positions are lazy {g = oo) this is only possible if Z — >■ r is a 
collapsing rule. 

Although a pseudo-collapsing rule cannot make a layer itself disappear, it can make it 
disappear '"scmantically'" , i.e. regarding distances. This has an immediate consequence on 
convergence: 

Proposition 9.11. If a non-collapsing iTRS contains a pseudo-collapsing rule then it can- 
not be convergent. 

Proof. Let Z — )• r be that rule and vdepth{x,r, g){l) = 1 and vdepth{x,l, g){l) < 1. We can 
foim the infinite term t = l[t/x], because V depth {x, I, g){l) < 1. For the rewrite step r [t/.T] — >■ 
r[r[t/a;]/a;] we notice that dg{r[t/x],r[r[t/x]/x]) = dg{t,r[t/x]) because vdepth(x,r, g){l) = 
1. This means the rewrite sequence commencing with t has identical distances in each 
rewrite step. This can only be convergent if they are all 0, which means that the substitution 
t/x must a unifier for I and r. This in turn is only possible if r = x which is ruled out by 
the non-collapsing condition. □ 

As pseudo-collapsing rules break convergence we know that such rules must be absent 
when we combine two non-collapsing convergent iTRSs (over term metric g) . Their absence 
also gives an impact on an invariant of the step-function: 

Lemma 9.12. Let t u in the disjoint union of 2 non-collapsing iTRSs with joint 
term metric g. For every principal path q of u there is a principal path p of t such that 
Vj. stepg(n,g,j) > stcp^(t,p, j). 

Proof. Single rewrite step: If g is a principal path not crossing the redex position then q is 
also in t and the result is immediate. Otherwise there is an i such that q{i) is a position 
above the contractum, and q{i + 1) = q{i) ■ v the position v is below a variable position of 
some variable y in the contractum. On the left-hand side of the step there is a path p sharing 
the first i positions with q, where p{i + l) = p{i)-w such that w; is a (depth-minimal) position 
of y in the redex. As the applied rule is not pseudo-collapsing, if {t,p{i))g ^ {t,p{i -\- l))g 
then we also must have {u,q{i))g ^ {u,q{i -\- l))g. 



stepg{tj,0) 
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The transitive closure clearly preserves the property. Transfinite reductions do not 
change it either as (in the absence of rules with extra variables) limits cannot introduce 
new principal paths. □ 

What Lemma [9.12l savs is that the rate of convergence cannot go down when we rewrite, 
provided we measure it in numbers of layers (rather than actual distances). 

Theorem 9.13. The disjoint union of non-collapsing iTRSs with granular term metrics 
preserves convergence. 

Proof. Let / be a reduction sequence in that disjoint union. We show that for any e > 
there is an n such that pointwise dm{f{'y),f{'y)[n' \ x]) < e; by convergence of f[n \ x] 
this implies that / stays eventually with an e-ball, and as this holds for any e, / converges. 

We can find the n as follows: first notice that for some k, e > . We need to find 
an n such that step(/(0),p, n) > k for every principal path p of /(O). As /(O) G Ter™(Il), 
every principal path p in it is converging which implies step(/(0),p, n) > k for some n, and 
as /(O) is a finitely branching tree, some n will do for all. 

By Lemma l9 . 1 2 1 this property is preserved by rewriting: if t u and step(t,p, n) > k 
for all principal paths p oi t then also step(w,(?,n) > k for all principal paths q of u. This 
also implies that u and n[n \ x] are within distance of each other. □ 



10. Modularity of strong convergence 

Simonsen showed [fl9j that top-termination (and thus strong convergence) is a modular 
property of left-linear iTRSs under term metric co. Left-linearity is indeed an unnecessary 
constraint: 

Theorem 10.1. Over term metric oo, the disjoint union of two strongly converging non- 
collapsing iTRSs is a strongly converging iTRS. 

Proof. Strong convergence implies that the contributing iTRSs are non-collapsing, hence the 
previous results about convergence apply. According to |24) an iTRS is strongly converging 
if and only if it contains no reduction sequence that reduces a root redex infinitely often. 
Suppose / was such a sequence. But /[I \ x] resides in the iTRS R of the top layer; as the 
construction preserves all top layer reductions it also preserves all root reductions which 
contradicts the premise that R is strongly converging. □ 

This statement does not even need a side condition banning non-collapsing rules, since 
these break the strong convergence property on the individual iTRSs and are therefore 
known to be absent. For other metrics this is no longer true: term metric id (all components 
of all umms are the identity function) rules out infinite terms and makes strong convergence 
coincide with strong normalisation. 

Theorem 10.2. Let R and S he non- collapsing strongly convergent iTRSs. If R + S is 
convergent then it is strongly convergent. 



Proof. The same argument as in the proof of Theorem 110.11 applies, i.e. if / is not top- 
terminating then /[I \ x] is not either. □ 
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11. Conjectures and Open Problems 

I would conjecture that the disjoint union of a convergent iTRS with a strongly conver- 
gent one is convergent (under term metric c«). If the (merely) convergent iTRS contained 
no cohapsing rule then this follows already from the results in |10j , but it may hold without 
that proviso. What makes a proof for this conjecture technically awkward is that collaps- 
ing rules allow layers to merge. However, this capability is highly restricted in converging 
systems. 

Convergence requires that reduction sequences of any length converge. A related prop- 
erty is a-convergence, which means that reduction sequences up to length a converge, while 
longer ones may not. Example 16.21 is not convergent in general, but it is w x n-convergent, 
for finite n. It depends on the ordinal a whether the constructions from this paper would 
still work unchanged. 

The most interesting special case is w-convergence, and there is a problem with con- 
structing the simulated reduction sequence relative to a predicate sequence: when "time is 
moved on" the simulation potentially requires w-many steps to rewrite potentially infinitely 
many occurrences of / to r. This cannot be done if the overall length of the simulated 
sequence is limited to u. Finitely many steps would suffice to move on time if all rules were 
finitely linear, which means that variables only occur a finite number of times on both left- 
and right-hand sides of rules. While necessary to repair the proof it is unclear whether that 
constraint is necessary to repair the theorem: are there tj-convergent iTRSs that are not 
finitely linear and whose disjoint union fails to be cj-convergent? 

12. Conclusion 

For the standard metric d^o it has been shown that both weak and strong convergence 
are modular properties of non-collapsing iTRSs. Otherwise, the iTRSs involved can have 
various usually undesirable properties: non-left-linear rules, infinite left-hand sides, and 
infinitely many rules are all permitted. 

The proof for strong convergence is perfectly straightforward, the proof for weak con- 
vergence is not. At the heart of the latter is a consideration that the reductions in the top 
layer are limited in the way they can rearrange the principal subterms, in particular that the 
principal subterms in a fixed position must form a focussed sequence. Moreover, abstract 
reduction systems are converging iff their focussed sequences do, and both observations 
together give that result. 

For other metrics, these arguments still apply to reductions of finite rank (Theorem 19. 7p . 
while for reductions of infinite rank there are counterexamples. As a general proof strategy, 
various convergence arguments have been constructed along the central idea of limiting the 
pointwise distance between the reduction sequence under investigation and a finite rank 
approximation of that sequence. These allow to generalise the modularity of convergence 
and strong convergence of non-collapsing iTRS beyond metric doo, to systems with either 
granular term metrics or (not elaborated here) arbitrary metrics but depth-preserving rules. 
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